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Abstract 

A nonholonomic system, for short "NH", consists of a configuration space Q", a Lagrangian L{q,q,t), a 
nonintegrable constraint distribution 7i C TQ, with dynamics governed by Lagrange-d'Alembert's principle. 
We present here two studies, both using adapted moving frames. In the first we explore the affine connection 
viewpoint. For natural Lagrangians L = T — V , where we take V = for simplicity, NH-trajectories are 
geodesies of a (non metric) connection Vnh which mimics Levi-Civita's. Local geometric invariants are 
obtained by Cartan's method of equivalence. As an example, we analyze Engel's (2-4) distribution. This 
is the first such study for a distribution that is not strongly nonholonomic. In the second part we study 
G-Chaplygin systems; for those, the constraints are given by a connection (p : TQ — > Lie{G) on a principal 
bundle G^Q^S = Q/G and the Lagrangian L is G-equivariant. These systems compress to an almost 
Hamiltonian system {T* S, H'^ ^nh = ^can + (J-K), with d{J.K) 7^ in general; the momentum 
map J : T*Q — > Lie{G) and the curvature form K : TQ — > Lie{G)* are matched via the Legendre 
transform. Under a, s £ S dependent time reparametrization, a number of compressed systems become 
Hamiltonian, i.e, ^Inh is sometimes conformally symplectic. A necessary condition is the existence of an 
invariant volume for the original system. Its density produces a candidate for conformal factor. Assuming an 
invariant volume, we describe the obstruction to Hamiltonization. An example of Hamiltonizable system is 
the "rubber" Chaplygin's sphere, which extends Veselova's system in T*SO{3). This is a ball with unequal 
inertia coefficients rolling without slipping on the plane, with vertical rotations forbidden. Finally, we 
discuss reduction of internal symmetries. Chaplygin's "marble" , where vertical rotations are allowed, is not 
Hamiltonizable at the compressed T*S0{3) level. We conjecture that it is also not Hamiltonizable when 
reduced to r*^^. 
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"Nonholonomic mechanical systems (such as systems with rolhng contraints) provide a very interesting 
class of systems where the reduction procedure has to be modified. In fact this provides a class of systems 
that give rise to an almost Poisson structure, i.e, a bracket which does not necessarily satisfy the Jacobi 
identity " (Marsden and Weinstcin [2001]). 



1 Introduction and outline 

Cartan's moving frames method is a standard tool in Riemannian geometry^. In Analytical Mechanics, the 
method goes back to Poincare [1901], perhaps earlier, to Euler's rigid body equations, perhaps much earlier, 
to the caveperson who invented the wheel. Let q G M"' be local coordinates on a configuration space Q"", 
and consider a local frame, defined by an n x n invertible matrix B{q), 

i=l 

In Mechanical Engineering, Hamel [1949], Papastavridis [2002], moving frames disguise under the keyword 
quasi- coordinates, nonexisting entities tt such that 

^-V^^-V^?> -Y(f) 
dT,. - 2^ dqi avr, " ^ dqi ~ '^^^ 



Let {ej}i=i_...^n be the dual coframe to {Xj}, ei = ''^ dni" = ^ ■ aij dq 



^ ■ 

1.1 Moving frames: Lagrangian and Hamiltonian mechanics 

The Euler-Lagrange 1-form rewrites as^: 

where L*{q,7r,t) = L{q, B{q)7r,t) is the Lagrangian written in "quasi-coordinates" and = Fgbsk are 
the covariant components of the total force (external, F^xt, and constraint force A). The so called Hamel's 
transpositional symbols Ykj — l]k ~ S"^=i ^skbij {dois/dqi — dau/dqs) are precisely the moving frame 
structure coefficients (Koiller [1992]). 

If the velocities are restricted to a subbundle TC C TQ, a constraint force A appears. D'Alembert- 
Lagrange principle^ implies that A belongs to the anihilator Tl° C T*Q of Ti, hence exerting zero work on 
admissible motions q ^7i: 



^Cartan [1926]; there is a recent English translation from the Russian translation (Cartan [2001]). One of the most important 
applications was the construction of characteristic classes by Alan's adviser, S.S. Chern. Our taste for moving frames in 
Mechanics is a small tribute to his influence. 

Atributed to Hamel, but certainly known by Poincare. Quasi-coordinates can be found in Whittaker [1937] and were first 
used in Mechanics by Gibbs, see Pars [1965]. 

''According to Sommerfeld [1952], this gives the most natural foundation for Mechanics. 
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Using moving frames, constraints can be eliminated directly. If 71° is spanned by the last r forms 
ej, s + 1 < J < n (s = n — r), then equations of motion result from setting the first s Euler-Lagrange 
differentials equal to zero: 

^l^-'i^,^t%t^h^'-^i"-» a<^-<»). (1.4) 

4 = 1 j = l 

Strikingly, the Hamiltonian counterparts of (1.2) and (1.4) are simpler, although less known^. The 
philosophy is to fight against Darboux's dictatorship. In terms of the local coframe {ei}i<i=i<n) any 
element pq G T*Q can be written as = ^ miei{q). The natural 1-form a on T*Q keeps the familiar 
confusing expression a := pdq = me . Consequently, the canonical symplectic form := da writes as 

:= dp f\dq = dm f\e + mde . (1-5) 

The second term mde, which deviates from Darboux's format, is not a nuisance, it carries most valuable 
information. For instance, Kostant-Arnold-Kirillov-Souriau's bracket in T*G, G a Lie group, can be imme- 
diately visualized: take a (left or right) invariant coframe and apply H. Cartan's "magic formula" on de. 
So moving frames are ideally suited when a Lie symmetry group G is present^. 



Example: mechanics in 5*0(3). To fix notation, we now review the standard example. The Lie algebra 
basis Xi S sO(3) = TiSO{3), i = 1,2,3 (infinitesimal rotations around the x,y,z-a,xis at the identity), can 
either be right or left transported, producing moving frames on S0{3) denoted {X^} and {X-} respectively. 
Let {pi}i<i<3 and {Aj}i<j<3, denote their dual coframes (right and left invariant forms in 50(3)). To 
represent angular momenta, we use Arnold's notations (Arnold [1989]): capital letters mean objects in 
body frame, smallcase objects in the space frame. Thus for instance, £ = RL, where L is the angular 
momentum in body frame and £ is the angular momentum in space; likewise to = RQ relate the angular 
velocities. The canonical 1-form in T*SO{3) is given by 

a = iipi + £2P2 + hPz = LiXi + L2A2 + L3A3 

so 

^can = ^ d£ipi + £idpi + l2dp2 + isdps = ^ dLiXi + LidXi + L2dX2 + Lac^As, 

where by Cartan's structure equations, dXi = —X2 A A3, • • • and dpi = /32 A ps, • • • (cyclic). A left invariant 
metric is given by an inertia operator L = AQ. Euler's rigid body equations follow immediately. 

Poisson action of 5^ on 50(3). Consider the left action on 50(3) given by exp{icj}) ■ R := S{(j)) R 
where 5(0) is the rotation matrix about the z-axis: 

/ cos(<^) -sin(0) \ / -1 \ 

S{(j)) := sm{^) cos{^) , 5(-0)5'(</.) =10 = X3 . 
\0 1/ \000/ 

*A "moving frames operational system" for Hamiltonian mechanics in T*Q was given in Koiller, Rios and Ehlers [2002]. 

^As we learned from Alan at the banquet, the etymology for symplectic is "capable to join", themes and people. The latter 
is one of the most important aspects of the symplectic "creed". Provocation: taking moving frames, adapted to some other 
mathematical structure for Q, would the non-Darboux term provide a local symplectic invariant? 
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Two matrices are in the same equivalence class iff their third rows, which we denote by 7, called the Poisson 
vector, are the same: So we have a principal bundle vr : SO {3) — > S , 

7 = tt{R) = R~^k = R^ k. The derivative of vr is 

= vr,(^) = -{R~^RR^^)k = -{R^^R){R-^)k = -[n]-f = -Qx-f = -fxn (1.6) 

where we used the custumary identification^ [Jl] G sO(3) ^ Q G M^, Arnold [1989]. The lifted action to 
T*SO{3) has momentum map J = £3. 

Connection on ^ 50(3) — > S'^. Take the usual bi- invariant metric << , >> on SO{3) so that both 
{X^} and {X^} are orthonormal moving frames. The tangent vectors to the fibers are {d/d(/))S{(f)) ■ R = 
^rig/xt^ Consider the mechanical connection associated to << , >>, namely, horizontal and vertical spaces 
are orthogonal. The horizontal spaces are generated by x"^^^ and Xl^^^^ . The connection form is (f) = p^. 
The horizontal lift of 7 to i? is the tangent vector R such that 

^hor = R-^R =[7X7] (1-7) 

Note that ^hor is the -90 degrees rotation of 7 inside T^S"^. The curvature of this connection n = dp^ is 
the area form of the sphere. 

Reduction of 5^ symmetry. It is convenient for reduction to use (0,^3), a G IR? , a _L 7 , 

L:= ax 7 + ^37 (1.8) 

where a is a vector perpendicular to 7. The vector a has an intrinsic meaning: Consider a moving frame 
ei, 62 in 5^, with dual coframe 61, 02- Then Vy = ^161+^262 parametrizes TS^, and p^y = a-d'y = pi^i+P2^2 
parametrizes T*S'^, a = pi ei + p2 62- Here a • d'y , ^ 7jC?7i = denotes both an element of r*^^ and the 
canonical 1-form. Our parametrization for SO{3) is R{(j),^) = S{<j)) ■ -^(7), Rij) = rows(ei, 62, 7). Then 
L = p2ei — Pi 62 + ^3 7 corresponds to i = {p2, — Pi, ^3) along the section (/) = 0. The right invariant forms 
are compactly represented as 

P3 = # - (dei,62) , pi + ip2 = -iexp{i(l)){ei+ie2) . (1.9) 

Lifting V G TS'^ to an horizontal vector in TSO{3) is simple: 

^hor = [{vi ei + V2 62) X 7] = [v2 61 - vi 62] or hor{v) = V2X{- vi X^ , (1-10) 

Hence any vector R G TSO{3) can be written as R = uji Xf + uj2 X2 + u;3 X| with uJi = V2,oj2 = — f 1. Any 
covector pji G T*S0{3) can be written as p/j = pi 7r*{9i) + P2 tt*{02) + ^3 Ps- 

The reduced symplectic manifold {is) / = T* S"^ can be explicitly constructed, taking the section 
= 0. Let « : T*S2 ^ r*50(3), 

i{l^Pi^P2) = {R{l),(), i={P2,-pi,h) . (1.11) 

^We will drop the [•] and • in the sequel, and mix all the notations, hoping no confusion will arise. Equation (1.6) is one 
half of every system of ODEs for S^-equivariant mechanics in 50(3). Of course, we also obtain 7 — — x 7 by differentiating 
R-y — k (we could use the notation ^ — K, but we won't). 
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Then from (1.9) we get i* p2 = —6i , i* pi = 02, and i*d = di* yields 



i* dpi = d62 , i* 



dp2 



d9i , i* dps = i*pi A i*p2 



— —62O1 — 61 62- 



We get immediately 



d{pi 61+P292) + h area = Q!^I'g2 + h axesis^ . 



(1.12) 



All references to the moving frame disappear, but the expression ^^^^2 = d{pi9i +P2 02), suggests that 
whenever a natural mechanical system in T*SO{3) reduces to T*5^ = TS'^, there is a prefered choice for the 
moving frame {ei, e2}'y'- namely, that one which diagonalizes the Legendre transform T* S"^ = S'^ 

of the reduced (Routh) Lagrangian. 

1.2 Nonholonomic systems 

A NH system (Q, L, 7i) consists of a configuration space Q", a Lagrangian L : TQ x ]R ^ JR , and a totally 
nonholonomic constraint distribution 7i C TQ. The dynamics are governed by Lagrange-d'Alembert's 
principle^. Usually L is natural, L = T —V where Tis the kinetic energy associated to a Riemannian metric 
( , ), and V = V{q) is a potential. By totally nonholonomic we mean that the filtration TC C TCi C 7^2 C ... 
ends in TQ. Each sub-bundle Tii+i is obtained from the previous one by adding to 7ii combinations of 
all possible Lie brackets of vectorfields in TCi. To avoid interesting complications we assume that all have 
constant rank. Equivalently, let 7i° C T*Q the co-distribution of "admissible constraints" anihilating TC; 
dually, one has a decreasing filtration of derived ideals ending in zero. 

Internal symmetries of NH systems: Noether's theorem. An internal symmetry occurs whenever 
a vectorfield £ TC preserves the Lagrangian. For natural systems is a Killing vectorfield for the metric. 
Noether's theorem from unconstrained mechanics remains true. The argument (cf. Arnold, Kozlov, and Neishtadt 
[1988]) goes as follows: denote by (p^is) the 1-parameter group generated by ^ and let (/'(s,t) = (j)^{s) ■ q{t), 
so 4>' = -^(j) = ^,Q{(t>)- where q{t) is chosen as a trajectory of the nonholonomic system. Differentiating 
with respect to s the identitly L{(l){s,t), ■^(f>{s,t)) = const., after a standard integration by parts we get 
■^{^(p') = [T](l)' ■ This vanishes precisely when (j)' = £,q £ TC so := ^ ■ ^ = const. 

External symmetries: G-Chaplygin systems. External (or transversal) symmetries occur when group 
G acts on Q, preserving the Lagrangian and the distribution TC, this meaning that g^:TCq = TCgq. In the 
most favorable case one has a principal bundle action ^ — > S*™, m + r = n, where TC forms the 
horizontal spaces of a connection with 1-form (f) : TQ LieiG). These systems are called G — G haply gin^ . 

^"Vakonomic" mechanics uses the same ingredients, but the dynamics are governed by the variational principle with con- 
straints, and produce diflerent equations, see e.g. Cortes, de Leon, de Diego and Martinez [2003]. The equations coincide if 
and only if the distribution is integrable. In spite of many similarities, there are striking differences between NH and holonomic 
systems. For instance, NH systems do not have (in general) a smooth invariant measure. Necessary and sufficient conditions 
for the existence of the invariant measure were first given (explicitly in coordinates) by Blackall [1941]. 

*A "historical" remark (by JK). Chaplygin considered the abelian case. During a post-doctoral year in Berkeley, way 
back in 1982, I became interested in NH systems with symmetries. Alan directed me to two wonderful books: Hertz [1899] 
Foundation of Mechanics and Neimark and Fufaev [1972]. In the latter I learned about (abelian) Chaplygin systems, presented 
in coordinates. I said to Alan that I would like to examine non-abelian group symmetries, and Alan immediately made a 
diagram on his blackboard, and told me: "well, then, the constraints are given by a connection on a pricipal bundle". This 
was the starting point of Koiller [1992]. 
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Terminology. Since Bates and Sniatycki [1993], andBloch, Krishnaprasad, Marsden and Murray [1996], 
several authors have called attention on these two types of symmetries. Reduction of internal symmetries 
was described already in Sniatycki [1998]. To stress the difference, reduction of external symmetries is called 
here compression. The word reduction will be used for internal symmetries. 

LR systems. Veselov and Veselova [1986], Veselov and Veselova [1988] considered Lie groups Q = G 
with left invariant metrics, with constraint distributions given by right translation of P C Lie{G), i.e., 
the constraints are given by right invariant forms. For a LR-Chaplygin system, in addition there is a 
decomposition Lie{G) = Lie{H) © D, where H \s a, Lie subgroup such that Ad^-iD = h^^ D h = D. 
Therefore H ^ G ^ S = G/H is a //-Chaplygin system; the base S is the homogeneous space of cosets 
Hg. Fedorov and Jovanovic [2003] considered the case where G is compact and that Lie{H) is orthogonal 
to D with respect to the bi-invariant metric^. 

Compression of G-Chaplygin systems. From symmetry, it is clear that the Lagrange-D'Alembert 
equations compress to the base TS^^. In covariant form, the dynamics takes the form [L"^] = F(s,s), 
where L'f'{s,s) = L{s,h{s)) is the compressed Lagrangian in TS; h{s) is the horizontal lift to any local 
section and -F is a pseudo-gyroscopic force^^. In order to write F explicitly, take group-quasicoordinates 
{s,s,g,7r). Write q = gcr{s), with g £ G and a local section a{s) Q ^ S. Fix a basis X^. for the 
Lie algebra, [Xft:,-'^L] = ^kl-^J^ ■^{'^) — X^^/^/- Any tangent vector q G T^-^^^Q can be written as 
q = da{s) ■ s + ^((vr)) • cr(s). Horizontal vectors are represented by vr = b{s) ■ s, where b{s) is an r x m 
matrix. The connection 1-form writes as (j){q) = tt — b{s) ■ s. Then 

- . - - E ( t ) • E - If + E^ eg.) , . (.13, 

1.3 Main results 

Using the moving frames method we present results on two aspects of nonholonomic systems. 

• Cartan's equivalence, using Cartan's geometric description of NH systems via affine connections 
(Cartan [1928]). The objective is to find all local invariants. 

• Chaplygin systems: compression of external symmetries, reduction of internal symmetries. The ob- 
jective is to generalize Chaplygin's "reducing factor" method (Chaplygin [1911]), namely, verify if 
Hamiltonization is possible (via conformally symplectic structures). 

^These conditions are not met in the marble and rubber Chaplygin spheres, see section 3.2; however, the Veselov's result 
(theorem 3.3 below) on invariant volume forms still holds. 

^"The full dynamics can be reconstructed from the compressed solutions, horizontal lifting the trajectories via <j), since the 
admissible paths are horizontal relative to the connection. This last step is not "just" a quadrature; in the non-abelian case, a 
pat/i-ordered integral is in order. For G = 50(3), see Levi [1996] found an interesting geometric construction. 

^^This nonholonomic force represents, philosophically, a conceiled force in the sense of Hertz [1899], having a geometric origin. 
This force vanishes in some special cases, not necessarily requiring the constraints being holonomic. Equivalently, the dynamics 
in TS is the geodesic spray of a modified affine connection. One adds to the induced Levi-Civita connection in TS a certain 
tensor B{X,Y). This NH connection in general is non-metric (Koiller [1992]). 
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Results on Cartan's equivalence. In section 2 we analyze NH systems under the affine connection 
perspective. We pursue the (local) classification programme proposed by Cartan [1928] using his equivalence 
method. See Koiller, Rodrigues and Pitanga [2001] and Tavares [2002], for a rewrite of Cartan's paper in 
modern language. Cartan's method of equivalence is a powerful method for uncovering and interpreting 
all differential invariants and symmetries in a given geometric structure. In Ehlers [2002] NH systems in 
a 3-manifold with a contact distribution were classified. Here we go one step further, looking at Engel's 
distribution in 4-manifolds (see definition below). Our results are summarized in Theorem 2.3. The "role 
model" here is the rolling penny example (no pun intended) . This is the first such study for a distribution 
that is not strongly nonholonomic. Next in line is studying the famous Cartan's 2-3-5 distribution. 

Results on G-Chaplygin systems. Instead of using (1.13) in TS, we may describe the compressed 
system in T*S almost Hamiltonian system 

ix^NH = dH, H = H^:T*S^1R, nNH = Crf + (J-K) , (1.14) 

where H'^ is the Legendre transform of the compressed Lagrangian. (J.K) is a semi-basic 2-form on T*S 
which in general is not closed. As one may guess, J is the momentum map, and K is the curvature of the 
connection. Ambiguities cancel, since J is A(i*-equivariant while K is ^d-equivariant. The construction is 
independent of the point q on the fiber over s. 

Under an s G 5 dependent time reparametrization, dr = f(s) dt , several interesting compressed G- 
Chaplygin systems become Hamiltonian. A necessary condition is the existence of an invariant volume 
(Theorem 3.3), whose density F produces a candidate / = , m = dim(S') for conformal factor. 

Chaplygin's "rubber" ball (vertical rotations forbidden) is, as far we know, a new example, and generalizes 
the well known Veselova system in SO{2>) (Proposition 3.6). We describe the obstruction to Hamiltonization 
as the 2-form ix d{f^MH) (Theorem 3.4) and we discuss further reduction by internal symmetries. An 
example of the latter situation is Chaplygin's "marble" (a hard ball with unequal inertia coefficients rolling 
without slipping on the plane). It is non Hamiltonizable in T*S'0(3), and our calculations suggest that it is 
also non-Hamiltonizable when reduced to T* S'^ (heorem 3.8). Compare with Borisov and Mamaev [2001]. 

What does Hamiltonization accomplish. Why we focus so much on the question of Hamiltonizability? 
The example of the reduced equations for Chaplygins skate (after a 2- dimensional euclidean symmetry is 
removed) shows that changing time scale in a nonholonomic systems can completely change its character. 
In this example (see e.g. Koiller [1992]) the fully reduced equations of motion are not Hamiltonian because 
every solution is asymptotic in forward and backward time to a point, which depends on which solution 
you choose. However, after rescaling time the fully reduced equations become Hamiltonian, namely, the 
harmonic oscillator. However, this Hamiltonian vector field is incomplete because along one of the coordinate 
axes the time rescaling is not defined^^. In light of this example, why is time rescaling interesting? The 
answer is that it is interesting mostly in the context of integrability, where no singularities are removed in 
the phase space. See section 3. 

^^For details, see Koiller, Rios and Ehlers [2002], Koiller and Rios [2001]. The Hamiltonian compression for Chaplygin 
systems was first explored, in the abelian case, by Stanchenko [1985]. The non-closed term was described as a semi-basic 
2-form, depending linearly on the fiber coordinate in r*^, but its geometric content was not indicated there. 

^^We thank one of the referees for this observation. 
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2 Nonholonomic geometry: Cartan equivalence 

A Cartan nonholonomic structure is a triple {Q,G = ( • , • ),'H) where Q is an n-dimensional manifold 
endowed with a Riemannian metric G and a rank r, totally nonholonomic distribution 7i. Our motivation 
for studying such a structure is a free particle moving in Q, nonholonomically constrained to 7i, with kinetic 
energy T = ^( • , • ). The nonholonomic geodesic equations are obtained by computing accelerations using 
the Levi-Civita connection associated with G and orthogonally projecting the result onto 7i. The projected 
connection is called a nonholonomic connection (Lewis [1998]), and was introduced by Cartan [1928]. A 
distribution 7i is strongly nonholonomic if any basis of vectorfields spanning Ti onU C Q, together with their 
Lie brackets, span the entire tangent space over U. The equivalence problem for nonholonomic geometry 
was revisited in Koiller, Rodrigues and Pitanga [2001] and the generalization to arbitrary nonholonomic 
distributions was discussed. Engel manifolds provide the simplest example involving distributions that are 
not strongly nonholonomic"^^ . 

The main question we address is the following. Given two nonholonomic structures {Q,Q,7i) and 
{Q,G,'H), is there a (local) diffeomorphism f : U C Q ^ U C Q carrying nonholonomic geodesies in Q 
to nonholonomic geodesies in Q? In Cartan's approach, this question is recast as an equivalence problem. 
The nonholonomic structure is encoded into a subbundle of the frame bundle over Q, called a G-structure. 
The diffeomorphism f exists if the two corresponding G-structures are locally equivalent. Necessary and 
sufficient conditions for the G-structures to be equivalent are given in terms of differential invariants found 
using the method of equivalence. 

Outline. Our main example is the equivalence problem for nonholonomic geometry on an Engel manifold. 
Let Q be a four-dimensional manifold and TC he a rank two distribution. Ti. is an Engel distribution if and 
only if, for any vectorfields X and Y locally spanning 7i, and some functions a,b : Q ^ M, the vectorfields 
X , Y , Z = [X, Y] , and W = a[X, Z] + b[Y, Z] form a local basis for TQ. By an Engel manifold, we mean a 
four-dimensional manifold endowed with an Engel distribution. We begin by describing the nonholonomic 
geodesic equations. In the spirit of Cartan's program, we express them in terms of connection one-forms 
and (co)frames adapted to the distribution. This formulation is particularly well suited to the problem at 
hand; the nonholonomic geodesic equations are obtained by writing the ordinary geodesic equations in terms 
of the Levi-Civita connection one-form and crossing out terms corresponding to directions complementary 
to Tl. We then set up the equivalence problem for nonholonomic geometry and give a brief description 
of the equivalence method as it is applied to our main example. We conclude this section by applying 
the method of equivalence to the case of nonholonomic geometry on an Engel manifold. We derive all 
differential invariants associated with the nonholonomic structure and show that the symmetry group of 
such a structure has dimension at most four. 

^''Historical remarks. Cartan [1928] introduced the equivalence problem for nonholonomic geometry and studied the case of 
manifolds endowed with strongly nonholonomic distributions. In his address, Cartan warned against attempts to study other 
cases because of the "plus compliques" computations involved. In the meantime strides have been made in the equivalence 
method by Robert Gardner and his students that allow computations to be made at the Lie algebra level rather than at the 
group level (Gardner [1989]). This together with symbolic computation packages such as Mathematical make equivalence 
problems tractable in many important cases. See Gardner [1989], Bryant [1994], Montgomery [2002], Grossman [2000], Ehlers 
[2002], Hughen [1995], and Moseley [2001] for some recent applications. 
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2.1 Nonholonomic geodesies: straightest paths 

Totally nonholonomic distributions. A distribution 7^ is a rank r vector subbundle of the tangent 
bundle T{Q) over Q. Let = TC + [H^TC] and = [7i,W], and consider the filtration 

n cn^ c ■ ■ -n' c ■ ■ ■ cTQ. 

H is totally nonholonomic if and only if, for some k, Ti'^ = TQ at all points in Q. For the present discussion 
we will assume that rank of each T-C have constant rank over Q. As a specific example, consider the Engel 
distribution Ti on IR* with coordinates {x,y,z,w), spanned by {Xi = -^,X2 = ^ + w-^ + y-§^}- There 
are, in fact, local coordinates on any Engel manifold so that the distribution is given by this normal form, 
see Montgomery [2002]. Then {Xi, X2, = [Xi,X2]} spans the three-dimensional distribution Ti^, and 
{Xi,X2,Xs,X4 = [X2,Xs]} spans the entire TM'^. 

A path c : M ^ Q is horizontal if c{t) € 'Hc(t) ^or all t. Chow's theorem implies that if 7i is totally 
nonholonomic then any two points in Q can be joined by a horizontal path (see Montgomery [2002]). At 
the other extreme, the classical theorem of Frobenius implies that TC is integrable, which is to say that Q 
is foliated by submanifolds whose tangent spaces coincide with 7i at each point, if and only if [XjjXj] G TC 
for all i and j (Warner [1971]). 

In what follows we will need a description of distributions in terms of differential ideals. Details can 
be found in Warner [1971] or Montgomery [2002]. Let I = Ti^ be the ideal in h*{Q) consisting of the 
differential forms annihilating 7i. If W is rank r, then I is generated by n — r independent one- forms. The 
first derived ideal of T is the ideal 

{!)' := {e (^1 \ de = Q mod (J)}. (2.1) 
If we set I^^^ = X and T^""'"^) = (T^"))' we obtain a decreasing filtration 

J = d D • • • D 0. 

The filtration terminating with the ideal is equivalent to the assumption that the distribution is completely 
nonholonomic. We note that I^^^ = (TC^)'^ for j = 1, but this is not true in general for j > 1 (see 
Montgomery [2002]). At the other extreme, the differential ideal version of the Frobenius theorem implies 
that H is integrable if and only if (J)' C X (Warner [1971]). 

For the Engel example, the one forms 7]^ = dy — wdx and rf = dz — ydx generate the ideal X. Notice 
that dif' = r]^ A dx so 7]'^ E I^^^ but dij^ cannot be written in terms of rj^ or 77^ therefore rj^ ^ 

The nonholonomic geodesic equations. There are two different geometries commonly defined on a 
nonholonomic structure {Q,G = ( • , • ),'H): subriemannian geometry and nonholonomic geometry. In 
subriemannian geometry one is interested in shortest paths. The length of a path c : [a,b] — > Q joining 
points X and y is i{c) = J \J (c, c)dt. The distance from x to j/ is d{x^ y) = inf(£(c)) taken over all horizontal 
paths joining x to y. In nonholonomic geometry one is interested in straightest paths, which are solutions to 
the nonholonomic geodesic equations. Hertz [1899] was the first to notice that shortest / straightest unless 
the constraints are holonomic^^. 

The nonholonomic geodesic equations are obtained by computing the acceleration of a horizontal path 
c : M ^ Q using the Levi-Civita connection associated with Q and orthogonally projecting the result onto 

^^The terminology straightest path for a nonholonomic geodesic was in fact coined by Hertz himself. 
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Ti. It is convenient to adopt the following indicial conventions: 



1 < I,J,K <n 
1 !^ (= rank('H)) 

r + 1 < u < n. 



(2.2) 



Let e = {ej} be a local orthonormal frame for which the Cj span TC, and let = {tji} be the dual coframe 
defined by ?//(ej) = J/j, the Kronecker delta function. We note that the r]'^ annihilate TC and the metric, 
restricted to Ti is g\f^ = rf- "rf- A- • • • -\- rf ® rf . The Levi-Civita connection can be expressed in terms of 
local one-forms w/j = —ujji satisfying Cartan's structure equation dr] = —oj A rj (Hicks [1965]). 

A horizontal path c : — > M is a nonholonomic geodesic if it satisfies the nonholonomic geodesic 
equations 



j 



0, 



(2.3) 



where I < i,j,< r and Vi = rf[c) are the quasivelocities. 



Example: the vertical rolling penny. A standard example of a mechanical system modeled by a 
nonholonomic Engel system is that of a coin rolling without sliping on the euclidean plane. Consider a coin 
of radius a rolling vertically on the xy-plane. The location of the coin is represented by the coordinates 
{x,y,9^4)). The point of contact of the coin with the plane is (x,y), the angle made by the coin with 
respect to the positive x-axis is 0, and the angle made by the point of contact, the center of the coin, and 
a point marked on the outer edge of the coin is (j). The state space can be identified with the Lie group 
SE{2) X 50(2) where the first factor is the group of Euclidean motions locally parametrized by x, y and 9. 
The mass of the coin is m, the moment of inertia in the 9 direction is J and the moment of inertia in the 
(j) direction is /. The kinetic energy, which defines a Riemannian metric on the state space, is 

TTt J I 

T = —{dx (g> dx + dy (g> dy) + -d9 (g> d9 + -dcj) ® d(j). (2.4) 
The penny rolls without slipping giving rise to the constraints 

x = (acos^)<^ , y = {a sin 9) <f> . (2-5) 
Consider the orthonormal frame (Xi, X2, X3, X4) where 

X3 := ./^f-sin^l- + COS0I-') , X4 := fcos^^- + sin^l-V 
^ y m \ ox ay J ^ ^ ymy ax ay J 

Note that the constraint subspace 7i = span{Xi,X2}, and 7i^^^ = spanjXi, X2, X3}. The dual coframe is 
[r]^ , r]'^ , r]^ , 7]'^) where 

^1 ._ ^rnc?±l^^ ^ ^2 ._ (2.7) 

rf := \f^{— sin 9dx + cos 9dy) , Vj^ := y^(cos 9dx + sin 9dy — dcj)). 
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To compute the Levi Civita connection form we determine uj = [w/j] such that ujjj 
dr] = —uj A rj. Using simple linear algebra we find 



-llIjj and 



UJ 







J_ / m 3 J_ / m 2 



\ 



J_ / m „3 



1 / m 2 1 / Tn 1 

72 V J(ma^+I) V J(ma^+/) ^ 



V2 V Jima^+I) 



(2i 



so in particular 



U)i2 — —t02l — - 



m 



2 V J(ma2 + /) 



Let c : JR — > Q be a nonholonomic geodesic given by c(t) = + V2{t)X2- Prom the structure 

equations we see immediately that L0i2{c{t)) = —uj2i{c{t)) = and the nonholonomic geodesic equations 
reduce to -^{vi) = ^{^2) = 0. The nonholonomic geodesies are solutions to {x,y,<p,9) = AXi + 3X2- In 
particular, 

V2Aacos9(t) . V2Aasm0(t) ■ V2A ■ ^23 

x = — — , y = — — , cp = , 6 = — -j^. (2.9) 

The trajectories are spinning in place (A = 0), rolling along a line {B = Q), or circles {A, B ^ 0). 



2.2 Equivalence problem of nonholonomic geometry 

Cartan's method of equivalence starts by encoding a geometric structure in terms of a subbundle of the 
coframe bundle called a G-structure. We begin this section by describing the G-structure for nonholonomic 
geometry^^. We then give a brief outline of some of the main ideas behind the method of equivalence as 
it is applied in our example of nonholonomic geometry on an Engel manifold. Details on the method of 
equivalence can be found in Gardner [1989], Montgomery [2002], or Bryant [1994]. We then derive the local 
invariants associated with a nonholonomic structure on a 4-dimensional manifold endowed with an Engel 
distribution. 



Initial G-structure for nonholonomic geometry. A coframe r]{x) at x E is a basis for the cotangent 
space T*{Q). Alternatively, we can regard a coframe as a linear isomorphism r][x) : Tx{Q) — > iR" where 
iR" is represented by column vectors. A coframe can then be multiplied by a matrix on the left in the 
usual way. The set of all coframes at x is denoted F*{Q) and has the projection mapping vr : F*{Q) 1— > x. 
The coframe bundle F*{Q) is the union of the F*{Q) as x varies over Q. A coframe is a smooth (local) 
section rj : Q F*{Q) and is represented by a column vector of one-forms (rj^ , ...,7]^)^^ , where "tr" 
indicates transpose. F*{Q) is a right G/(n)-bundle with action Rgi] = g^^rj where g is a matrix in Gl{n). 
Let G be a matrix subgroup of Gl{n). A G-structure is a G-subbundle of F*{Q). We now describe the G- 

^^This G-structure was first presented by Cartan in his 1928 address to the International Congress of Mathematicians (Cartan 
[1928]) 
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structure encoding the nonholonomic geometry associated with a nonholonomic structure {Q,G,7i). Given 
a nonholonomic structure {Q, Q, Ti.) we can choose an orthonormal coframe t] = {rf ^ rf)^"^ on [/ C Q so that 
the Vj'^ annihilate 7i and use this coframe to write down the nonholonomic geodesic equations as described 
above. On the other hand, given a coframe r) = {rj\ f]'^)^^ on Q we can construct a nonholonomic structure 
iQ,G = ^^2^^ ® ~^ ® where Ti is annihilated by the f]'^ . How is f] related to r] if it is to lead to 

the same nonholonomic geodesic equations as ij? In order to preserve Ti. we must have ij'^ — 7]"^ = (mod 
/). In matrix notation, any modified coframe f] must be related to ij by 

j)Ko :)(::)^ 

where A € Gl{r), a G Gl{n — r), and b G M{k,n — r). If we were studying the geometry of distributions 
there would be no further restrictions. In order to preserve the metric restricted to Ti, we must further 
insist that A G 0{r). We would then have the starting point for the study of subriemannian geometry (see 
Montgomery [2002], Hughen [1995], or Moseley [2001]). 

It is important to observe that that in nonholonomic geometry we need the full metric and not just its 
restriction to Ti (as in subriemannian geometry) to obtain the equations of motion. Cartan [1928] showed 
that in order to preserve the nonholonomic geodesic equations, we can only add covectors that are in the 
first derived ideal to the rf. 

Since this fact is central to our analysis, we sketch the argument here (see Koiller, Rodrigues and Pitanga 
[2001] for details). Suppose f) = grj with connection one- form defined by df] = —uiAi]. For simplicity assume 
that A = id, then rf = fj^ (mod I) . The geodesic equations are preserved if and only if ujij (T) = Coij (T) 
for all T £ TC, in other words iVij = Oij (mod 2). Note also that ij'^ = (mod 2). Subtracting the structure 
equations for dr/* and dfj"^ we get 

dry* — df/* = —iOij Ar]^ — oJiy Ar]'^ + uJij Af)^ -\- uJiy Af]'^ = (mod 2). 

Now ff = rf + bii,rf so we also have 

drj^ - dff = drf - {dif + dbiyri" + biyd-rf) = -hyd-rf (mod J) 

Therefore biydrj'^ = (mod 2) or equivalently bi^ri'^ G This completes the argument. 
We further subdivide our indicial notation: let 

r+l<(/)<s (= rank H^) , s + 1 < ^> < n. 

Adapted coframes. A covector rj = (r/*, 77''^, r/*)^^) arranged so that 

1. The rj'^ and 77* generate /, 

2. ds^\n = Er]'^v\ 

3. The ry* generate the first derived ideal 
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is said to be adapted to the nonholonomic structure. In matrix notation, the most general change of coframes 
that preserves the nonholonomic geodesic equations is of the form f) = grj where 

/ ^ h \ 

g=\ ai a2 (2.11) 
V ag / 

with A S 0(/c), h S M(n — s, k), oi G Gl{s — k), 02 G — s, s — /c), and 03 G G/(n — s). The set of all 
such block matrices form a matrix subgroup of Gl{n) which we shall denote Go- 

The initial G-structure for nonholonomic geometry on {Q,ds'^,7{) is a subbundle Bq{Q) C F*{Q) (or 
simply Bq if there is no risk of confusion) with structure group Gq defined above. All local sections of Bq{Q) 
lead to the same nonholonomic geodesic equations. In this way, the initial G-structure Bq{Q) completely 
characterizes the nonholonomic geometry. 

Two G-structures, B{Q) -S Q and B{N) -5 A^, are said to be equivalent if there is a diffeomorphism 
f : Q ^ N for which fi{B{Q)) = B{N) where /i is the induced bundle map. (If we think of 6 G B[Q) as a 
linear isomorphism h : T^^(^i,-)Q — > JR^ then /i(6) = to (/*)~^ where /* is the differential of /.) Our original 
question of whether there is a local diffeomorphism that carries nonholonomic geodesies to nonholonomic 
geodesies can be answered by determining whether the associated G-structures are locally equivalent. 

2.3 A tutorial on the method of equivalence 

Necessary and sufficient conditions for the equivalence between G-structures are given in terms of differential 
invariants which are derived using the method of equivalence. In this section we briefly describe some of 
the main ideas behind the method of equivalence as it is applied in our example. Details and other facets of 
the method together with many examples can be found in the excellent text by Robert Gardner (Gardner 
[1989]). One of the principal objects used in the method of equivalence is the tautological one-form. Let 
B{Q) Q he a G-structure with structure group G whose Lie Algebra is Lie{G). The tautological one- 
form 0, on B(Q) is an iR^-valued one-form defined as follows. Let rj : U C Q — > B{Q) be a local section of 
B{Q) and consider the inverse trivialization U x Gq ^ B{Q) defined by {x,g) — > g^^r]{x). Relative to this 
section, the tautological one-form is defined by 

n{b) = g-H7r*r]) (2.12) 

where b = g^^r]. From (2.12) one can verify that the tautological one-form is semi-basic (i.e. Q{v) = 
for all V G fcer(7r*)), has the reproducing property 7^*17 = fj where fj is any local section of B{Q), and is 
equivariant: Rg^ = g~^^. The components of the tautological one-form provide a partial coframing for 
B{Q) and form a basis for the semi-basic forms on B{Q). 

The following proposition reduces the problem of finding an equivalence between G-structures to finding 
a smooth map that preserves the tautological one- form. (See Gardner [1989] or Bryant [1994] for a proof.) 

Proposition 2.1. Let B[Q) and B{N) be two G-structures with corresponding tautological one-forms Qq 
and Oat, and let F : B{Q) B{N) be a smooth map. If G is a connected and F*{0,i\i) = Qq then there 
exists a local diffeomorphism f : Q ^ N for which F = fi, i.e. the two G-structures are equivalent. 

To find the map F in this proposition we would like to apply Cartan's technique of the graph (cf. Warner 
[1971] p. 75): if we could find an integral manifold S C B(Q) x B(N) of the one-form 6 = — that 
projects diffeomorphically onto each factor, then S would be the graph of a function h : Q ^ N for which 
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hl^N = ^Q- By the above proposition the G-structures would then be equivalent. We generally cannot 
apply this idea directly because and Qjy do not provide full coframes on B{Q) and B{N) as is required 
in the technique of the graph. In the example of nonholonomic geometry on Engel manifolds, and indeed in 
many important examples (see Gardner [1989], Hughen [1995], Moseley [2001], Montgomery [2002], Ehlers 
[2002]), application of the method of equivalence leads to a new G-structure called an e-structure. An 
e-structure is a G-structure endowed with a canonical coframe. 

Differentiating both sides of (2.12) one can verify that dQ satisfies the structure equation 



dn 



(2.13) 



where T is a semi-basic two-form on B{Q) and a is a called a pseudoconnection: a Lie(G)-valued one-form 
on B[Q) that agrees with the Mauer-Cartan form on vertical vectorfields. Here, Lie{G) is the Lie Algebra 
of G. Summarizing, 



Pseudoconnection 



a 



g ^dg + semibasic Lie(G)— valued one form. 



(2.14) 



The components of the pseudoconnection together with the tautological one-form do provide a full 
coframe on the G-structure, but unlike the tautological one-form, the pseudoconnection is not canonically 
defined. Understanding how changes in the pseudoconnection affect the torsion is at the heart of the method 
of equivalence. 

For any G-structure, that part of the torsion that is left unchanged under all possible changes of 
pseudoconnection is known as the intrinsic torsion. The intrinsic torsion is the only first order differential 
invariant of the G-structure (Gardner [1989]). As an example, the intrinsic torsion for the G-structure B 
of a general distribution (equation 2.10) is the dual curvature of the distribution (Cartan [1910], see also 
Montgomery [2002]). In the case of a rank two distribution on a four dimensional manifold, the structure 
equations for the tautological one-form $7 are 



V y 



/ ^11 A12 
A22 

V 



/?13 
/323 
"33 
"34 



/?14 
/?24 
"34 
044 / 



A 



/ 01 \ 




( \ 

rp2 


+ 













(2.15) 



where 



and T^2^^ ^ Note that the distribution is integrable if and only if T^^ 



M. The intrinsic torsion consists of the terms Tj^2^^ ^ 

0. 



12 



rp4 
^12 



Reduction and prolongation. There are two major steps in the equivalence method: prolongation and 
reduction (see Gardner [1989] or Montgomery [2002]). In the case of nonholonomic geometry on an Engel 
manifold a sequence of reductions lead to an e-structure. A brief outline of the reduction procedure is 
as follows. The first step involves writing out the structure equations for the tautological one- form $7. 
A semi-basic Lze(G)-valued one-form is added to the pseudoconnection to make the torsion as simple as 
possible. Gardner [1989] calls this step absorption of torsion. The action of G on the torsion is deduced 
by differentiating both sides of the identity R*g{S^) = (7~^^7. The action of G is used to simplify part of the 
torsion. The isotropy subgroup of that choice of simplified torsion is then the structure group of the reduced 
G-structure. In the case of nonholonomic geometry on an Engel manifold this procedure is repeated until 
an e-structure is obtained. 



2.4 The nonholonomic geometry of an Engel manifold. 



16 



Suppose that is the canonical coframing on the resulting a manifold B. The $7* form a basis for the 
one-forms on B so we can write 

dn^ = c^jkCI'^ a . (2.16) 

J<K 

Relationships between the Cj^ are found by differentiating this equation. The resulting torsion functions 
provide the "complete invariants" for the geometric structure (see Gardner [1989] p. 59, Bryant [1994] pp.9- 
10, or Cartan [2001]). 

Many important examples have integrable e-structures. An e-structure is integrable if the Cj^ are 
constant (Gardner [1989]). In this case we can apply the following result from Montgomery [2002]: 

Lemma 2.2. Let B he an n- dimensional manifold endowed with a coframing il. Then the (local) group G 
of diffeomorphisms of B that preserves this coframing is a finite- dimensional (local) Lie group of dimension 
at most n. The hound n is achieved if and only if the e-structure is integrahle. In this case the Cjj^ are the 
structure constants of G, G acts freely and transitively on B, and the coframe can be identified with the left 
invariant one- forms on G. 

The Jacobi identities are found by differentiating dQ^ = X]j<e' ^jk^^ ^ ■ Lie's third fundamental 
theorem then implies that we can, at least in principle, reconstruct the group G using the structure constants. 
In some circumstances one can also conclude that B itself is a Lie group (see Gardner [1989] p. 72). 



2.4 The nonholonomic geometry of an Engel manifold. 



The initial G structure for nonholonomic geometry on {QjGjTC} where 7i is an Engel distribution on a 
four-dimensional manifold M is the subbundle Bq C F*{Q) with structure group Gq consisting of matrices 
of the form 



(2.17) 



/ All 


A12 


BlA 


\ 




A22 


B2A 







a33 


034 




V 





044 


/ 



where A = [Ajj] G 0(2), 033044 7^ 0, and B14 and B24, are arbitrary. 

Let = {n^,n^, n^, J]^)*'' be the tautological one- form on Bq. The structure equations are 



/ ^1 \ 

^3 







-7 



7 



033 





/3l4 
/324 
"34 
"44 / 



A 



+ 



\ Tf^n^ A 



T^^n^ A 



T^^n'^An^ J 



(2.18) 



where we have chosen the pseudoconnection so that the remaining Tj^ are zero. Q."^ G /(I) so dJ]^ = mod 
(^3, Q^) and we must therefore have — 0- Also, 1^3 ^ /(i) gg d^l^ 7^ mod ($73, Jl"^) therefore the torsion 
function cannot equal zero. The pseudo-connection for this choice of torsion is not unique. We can, for 
instance, add arbitrary multiples of to the (3^ and 0^4. 

Following Cartan's prescription, we investigate the induced action of Go on the torsion. Let g £ Gq. 
To simplify notation, functions and forms pulled back by Rg will be indicated by a hat so, for instance. 
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R*n = n = Ci^, n^)^"" and = fj^-. we have 



# 
# 

det(a~^)(a44r2^ — 034^!^ 
det(a-i)(a33f^^ 



(2.19) 



) / 



To determine the induced action of Go on the torsion we differentiate both sides of the identity R*^l'^ 
d'^. For Q,"^ we compute 



033 AQ^ + a34 A + ff^n^ A 0^ 
det{A~^)ff2n^ AQ'^ (mod 



and 



det(a ^)(a44dO"^ — 034^0^) 
det(o-^)(a44rf2^^ AJl^) 



(mod 

(mod . 



The induced action of Gq on 9 is therefore 



12 



det{A) 

«33 



12 



(2.20) 



Since 7^ we can force it to equal 1 using the action of Gq. The stabihzer subgroup Gi for this choice 
of torsion consists of matrices of the form (2.17) with 033 = e where e = det(A). Note that Tj^2^^ ^ 
the (normahzed) dual curvature of the distribution. 
The structure equations for the Gi -structure Bi are 



^3 

J 



/ 7 /3i4 

-7 (32i 

034 

V 044 













A 

/ 




+ 

V 



23" 



Tfr^n^ An^ + r4rj2 ^ ^3 



(2.21) 



' 23-^ 



Let g £ Gi. We write the inverse of g as 



An 


A21 





Bu 


A12 


A22 





B2A 








033 


034 











«44 



(2.22) 



so in particular 033 = e, 034 = —6034(044) ^, and 044 = (044) ^. We have 



RIQ 



033 + 034 



V 



«44 



(2.23) 
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For the next reduction we differentiate both sides of the identity R*Q,'^ = Q,^. We have 

044 A + ff^n^ An^ + f^-^Cl^ A (mod 17^) 



. On the other hand 

dn^ = dudn^ = di4{Tf^n^ An^ + T^^n"^ a n^) (mod n'^) 

The induced action of Gi on the torsion plane {Tf^,T2^) is therefore 



5^3 
-^23 



044 



rp4 
m4 

-'23 



(2.24) 



The torsion plane (2^13, ^23) 7^ (0, 0) since I*-^^ = implies that dil'^AQ^ 7^ and we have already established 
that = 0- We can therefore use the action to force (r^3,r23) = (0, 1). The torsion T23^l^ A can be 
interpreted as the (normalized) dual curvature of the rank three distribution Ti.^. The statement that 
(T^^3,T23) 7^ (0,0) is equivalent to H not being integrable. To determine the subgroup that stabilizes this 
choice of torsion, we investigate 



J?* 



As ^ G 0(2) it must be of the form 



-A 



-1 



«44 



€i€2 
€2 



(2.25) 



(2.26) 



where ei, €2 G {—1, !}• We must also have 044 = €162 so that the stabilizer subgroup G2 consists of matrices 
of the form 

/ €162 Si4 \ 
£2 ^24 

ei 034 
V eie2 / 

where ei, €2 G {—1, 1} and B^, B24, and 034 G IR. We compute 



( \ 



( eie2^^ - Bun'^ 

£2^1^ — -624^^ 

eiO^ — e2a34r2^ 
V eie2f^' 



The structure equations are now 







( /3l4 A 


\ 




^^2 




/324 A 0^ 




+ 






034 A 




J 




^ 


) 


V 



Tl2^^ A + Tl: 



A 

Tf^^^ A + rfgS^l A -r ^23 



+ T2\i72 A 



(2.27) 



(2.28) 



T/^O^ A 0^ 



-j^giiL /Xii:, '-'■23 
+ ^2 A 03 + r2'402 A 0^ + r34403 A 0^ / 



(2.29) 
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B2 is not an e-structure so again we differentiate both sides of the identity Rg^ 
of G2 on the torsion. After some computation, we find that 



to determine the action 



mod ^t) 
eiB2in'An-' (mod Q^) 



BuQ.^ A n-^ 
•>2 A r>3 



Also, 



RlidQ') 
Rlidn'^) 

R*(dn^) 



e2flsn'n^ + 6162^2^3 



Matching the Jl^ A il'^ terms we find that 



Tr 



1 

23 

-'23 
-^23 



We can therefore use the action of Gi to force T23 = 
choice of torsion consists of matrices of the form 

/ eie2 






e27f3 - 

_ rp2 

— ^23 



eifi^af^^ A 
eiffaO^ A 
ein^ An^ 

eie2-Bi4, 

— £2-824, 

- £1034 ■ 



(mod 17^) 
(mod n^) 
mod n^). 



(2.30) 



T^3 

-^23 



V 



£2 









£1 









£i£2 / 



0. The stabihzer subg roup G final foi this 



(2.31) 



The reduced structure group is discrete so we now have an e-structure Bfi^ai- The tautological one-form 



{0,1,0.2,^3,94)^^ provides a full coframing for B final- The B final structure equations are 



/ Oi 

O2 
O3 
\ O^ 



12 

1 





^13 

-'13 

rr'3 
^13 



rp2 
T^3 

-^14 



^24 

^24 
7^3 
-^24 

-^24 



-'34 
-^34 

rp3 

-^34 
7^4 
-^34 



/ 9} AO? 
0} AO? 
0^ AO"^ 
0^ AO^ 
^2 AO^ 
\0^ AO^ J 



(2.32) 



where the T^j are functions on B final- What remains is to determine any second order relations between 

To determine these we use the fact that 



the torsion functions. To determine these we use the fact that d = 0. After some computation, we find 
that = + ^1^3 • We summarize these results in the following theorem: 

Theorem 2.3. Associated to any nonholonomic Engel structure {Q,Q = { ■ , ■ ),'H} there is a canonical 
G = Z2 X Zi2-structure B final- The tautological one-form (Qi, ^2, ^^3, ^^4)^^^ provides a canonical coframing 
for B final- The B final structure equations are 
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2.4 The nonholonomic geometry of an Engel manifold. 
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According to the framing lemma (lemma 2.2) the largest Lie group of symmetries of a nonholonomic 
structure on an Engel manifold is the dimension of B final which is four. In this case the Tj^ are constants 
and can be identified with the structure constants of the four-dimensional Lie algebra of the symmetry 
group. The Jacobi identities are obtained using the identity (P = 0. We have computed these, and it 
appears that the set of possible symmetry algebras form a rather complicated subvariety of the variety 
of all four-dimensional Lie algebras. We leave as an open problem the classification of all possible four- 
dimensional symmetry algebras for nonholonomic structures on an Engel manifold. 

The rolling penny (Continued). An example of a structure with maximal symmetry is given by the 
rolling penny. A iJ^j^a^-adapted coframe for the penny-table system is 



ma^ + 1 



= ^^de (2.34) 

3 Jj{ma^ -h I) , . ^ , ^ , , 
T] = (— smt^ax -|- cos f/ayj 

4 frn , 



The structure equations are 



ri = — {cosOdx + sin 



dr]^ = 

dr]"^ = (2.35) 



n 19/ rna^ + I n 

dif = V /\r] \ "n A V 
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The torsion functions are constant so by the framing lemma (lemma 2.2) we can identify these constants 
with the structure constants Lie group of symmetries of this system. We recognize them as the structure 
constants for the Lie algebra of the group SE{2) x S0{2) which is isomorphic to the configuration space of 
the penny-table system. 

i?jj„a;-adapted frames and coframes. The e-structure B final has a canonical coframing which descends 
to a coframing and hence a framing, up to signs, on Q. There should be a relationship between this 
framing and a canonical line field possessed by any Engel manifold. In this section we briefly describe this 
relationship. If Q and 7i are both oriented, then Q is parallelizable and the following constructions can be 
made globally (Montgomery [2002]). 

Let 7] he a B final adapted coframe on U CQ with dual frame X = {Xj} defined by v^{Xj) = 6ij. If 
f] is any other iJjj^a^-adapted coframe with dual frame Xon U, then, by theorem 2.3, fj is related to rj by 
77^ = eie2??^, f]^ = e2?7^, fj^ = (^irf, f]^ = eie2^'^- The dual frames are related in precisely the same way: 
Xi = £162X1, X2 = €2X2, X3 = [Xi,X2] = £1^3, and X4 = [X2,X3] = 6162^4. 

An important feature of an Engel distribution is the presence of a canonical line field L C TC (Montgomery 
[2002], Kazarian, Montgomery and Shapiro [1997]). L is defined by the condition that [L,Ti^] C Ti^. Here 
we are abusing notation, using L for the line field or a vectorfield spanning L. We have 
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Corollary 2.4. Let rj = rj^ be a B final -<idapted coframe. Let X = {X/} he the dual frame defined by 
ri^{Xj) = 5ij, then L = span{Xi). 

Proof. Suppose L is spanned by the vectorfield Y = aXi + 6X2. Since 7/^ annihilates Tl^ we have 
7?4([X3,y]) = 0. Then 

= r?4([X3,y]) = XsvHY) - YvHXs) - d7i\Xs,Y) = -dv^X^^Y). 
But dr]'^ = rf f\'rf' mod (77^) so we must have 

Q = rf ^r,\X,,,Y) = ri'{X^)rf{Y)-r,\XMy) 
= -v'{Xs)v2{Y) 
= -b. 



L is therefore spanned by Xi. This concludes the arguement. 

There is a natural metric, associated with B final, on Q given by gnat = 77^ (8> + • • -i?^ i8> where 
fj is any B final-adapted coframe. Clearly all S/ma^-adapted coframes induce this same metric; using the 
subriemannian metric gnatln we form L-*- within H so that H. = L@ L-*-. By construction, X2 spans 



3 Nonholonomic dynamics: Chaplygin Hamiltonization 

Historically, Hamiltonization of nonholonomic systems started with Chaplygin's last multiplier method. In 
the new time, the dynamics obeys Euler-Lagrange equations without extra terms] the gyroscopic force (1-13) 
"magically" disappers! When after a time reparametrization the compressed system can be described as a 
Hamiltonian system, symplectic techniques can be employed. A number of NH systems have been Hamil- 
tonized, and some interesting ones are Liouville-integrable, see Veselov and Vesclova [1988], Kozlov [2002], 
Fedorov and Jovanovic [2003], Fedorov [1989], Dragovic, Gajic and Jovanovic [1998], Borisov and Mamaev 
[2002a], Borisov and Mamaev [2002b], Borisov, Mamaev and Kilin [2002], Borisov and Mamaev [2001], Jovanovic 
[2003]. 

3.1 Compression to T*S, S = Q/G; existence of invariant measures 

We recall from the introduction that the compressed system has a concise almost Hamiltonian 

dH^ = ix^„^NH , ^NH ■■=^lan +{^-^) > dJ^TVH / (in general) , 

where ri^^ is the canonical 2-form of T*S and the (J.K) term is a semi-basic two form, which in general is 
non-closed. It combines the momentum J of the G-action on T*Q, and the curvature K of the connection. 
As this is important for the remaining, we outline the derivation (see Koiller, Rios and Ehlers [2002] for 
details). Given the coframe coordinates m, , e{q) in T*Q (see 1.5) the Poisson bracket matrix relative to 
€/ , dmi is 

= t^]"' = ( -1 \) M 

with 

EjK = midei{ej, ex) = -mi€i[ej, ex] ■ (3.2) 
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Let us consider the case of a principal bundle vr : with Lie group acting on the left, r = n — s. 

Recall our convention: capital roman letters I,J,K, etc., run from 1 to n. Lower case roman characters 
i,j,k run from 1 to s. Greek characters a, /3,7, etc., run from s + 1 to n. 

Fix a connection A = X{q) : TgQ — > Lie{G) defining a G-invariant distribution 7i of horizontal subspaces. 
Denote by K[q) = dAoHor : TqQ x TqQ Q the curvature 2-form (which is, as well known, Ad-equivariant). 
Choose a local frame Cj on S. For simplicity, we may assume that 

ei = d/dsi (3.3) 

are the coordinate vectorfields of a chart s : S ^ M^. 

Let Ci = h{ei) their horizontal lift to Q. We complete to a moving frame of Q with vertical vectors Ca 
which we will specify in a moment. The dual basis will be denoted ej, Eq, and we write pq = miei + rua^a- 
These are in a sense the "lesser moving" among all the moving frames adapted to this structure. We now 
describe how the n x n matrix E = (Eij) looks like in this setting. 

i) The s X s block (Eij). 

Decompose [ei,ej] = h[ei,ej] + V[ei,ej] = V[ei,ej] into vertical and horizontal parts. The choice (3.3) 
is convenient, since Cj and Cj commute: [ej,ej] is vertical. Hence 

Eij = -Pq[ei,ej] = -maea[ei,ej] . (3.4) 

Now by Cartan's rule, 

K{ei,ej) = eiX{ej) - ej\{ei) - A[ei,ej] = -\[ei,ej] G Q 

Thus we showed that 

[ci, ej\q = -K{ei, ej) ■ q (3.5) 
Moreover, let J : T*Q Lie{G)* the momentum mapping. We have 

{J{pq),Kq{ei,ej)) = pq{K{ei,ej).q) = -Pq[ei,ej] {= Eij) 

Theorem 3.1. (The J. K formula) 

E,j = {J{pq),Kq{e^,ej)) (3.6) 

This gives a nice description for this block, under the choice [ei,ej] = 0. Notice that the functions Eij 
depend on s and the components rua, but do not depend on g. This is because the Ad* -ambiguity of the 
momentum mapping J is cancelled by the Ad-ambiguity of the curvature K. The other blocks are not 
needed here, but we include for completeness. 

ii) The r x r block (Ea/s). 

Choose a basis Xa for Lie{G). We take ea{q) = Xa ■ q as the vertical distribution. Choosing a point qo 
allows identifying the Lie group G with the fiber containing Gqo, so that \A ^ qo- Through the mapping 
g £ G ^ gqo £ Gqo the vectorfied is identified to a right (not left!) invariant vectorfield in G. The 
commutation relations for the [ea,e^] = —cJ^gS-y appear with a minus sign. Therefore 



Eal3 = '^1^1(3 



(3.7) 
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iii) The s x n block {Eia). 

The vectors [cj, e^] are vertical, but their values depend on the specific principal bundle one is working 
with, and there are some noncanonical choices. Given a section a : Us ^ Q over the coordinate chart 
s : Us ^ iR™ on S, we need to know the coefficients b]^ in the expansion 

[ei,ea](cj(s)) = b]^{s)e^ . 

Then 

E,^{a{s)) = -m.,bUs) • (3.8) 
At another point on the fiber, we need the adjoint representation Adg : Lie{G) Lie{G), X i-^ g~^Xg, 
described by a matrix {A^a{g)) such that 

AdgiXo,) = Af,^ig)X^ . (3.9) 

Then 

[ei,ea]{g ■ cr{s)) = -m^b]^{s)A^a{g) • (3.10) 

The clockwise diagram. 

the fiber 7r^^(s) of Q over s 



TS < — T*S 
[Leg^r^ 

Taking differentials of all maps in (3.11) we obtain an induced principal connection (j) in the bundle G ^ 
Leg{Ti.) — > r*^. Let v,w,z £ Tp^{T*S), V,W,Z horizontal lifts at Pg E Leg(7i), and denote by K the 
curvature of this induced connection. The following proposition is basically a rephrasing of a result in 
Bates and Sniatycki [1993]. 

Proposition 3.2. 

d{J.K){v,w,z) = cydic{d J {V),K{W,Z)) . (3.12) 

Densities of invariant measures and a dimension dependent exponent. A necessary and sufficient 
condition for the existence of an invariant measure for compressed Chaplygin systems was obtained by 
Cantrijn, Cortes, de Leon, and de Diego [2002] (Theorem 7.5). Since in T*S there is a natural Liouville 
measure dvol = dsi...dsmdpi...dpm, where {s,p) are coordinates in T*S, the density function F produces an 
educated guess for a time reparametrization which may Hamiltonize the compressed system. If dim(5') = m 
and f^NH is closed, the time-reparametrized vectorfield Xnh/ f has the invariant measure /'"dvol. Xnh 
will have the invariant measure f^^~^ dsi...dsmdpi...dpm- Working backwards, if a measure density F is 
known so that F{s)dvo\ is an invariant measure for X^h, then the obvious candidate for conformal factor 
is 

/ = F(s)^. (3.13) 
This dimension dependent exponent will be relevant in the Chaplygin marble, see section 3.2. 



Starting on ps G T*S we go clockwise to Pq G Leg[Ti.) C T*Q, for some q on 

ncTQ — > Leg{n) C T*Q 

Leg 

T 

t (3.11) 
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Invariant measures for LR systems. Let Q = G a unimodular Lie group and identify TG = T*G 
via the bi-invariant metric. Assume that H C G is a subgroup acting on the left and preserving the 
distribution: V^g = hVg = h'Dg (which boils down to Ad^-i T) = h"^T>h = V). The Legendre transform 
Leg : Lie{G) Lie{G) = Lie*{G) of a natural, left invariant Lagrangian, is represented by a positive 
symmetric transformation A : Lie{G) Lie{G), the inertia operator. 

For each g £ G, let Pg and Pg be, respectively, the projections of Lie{G) relative to the decomposition 
Lie{G) = Adg-iLie(H) © Adg-iV. We can also think of Pg as a map Pg : TgG ^ V g, projection parallel 
to the vertical spaces Lie{H)g. Let Pg oLegg : Vg — > Vg. This map descends to the compressed Legendre 

transform Legf : Tg S ^ Tg S = T*S, where S = G/H is the homogeneous space whose metric is induced 
by the bi-invariant metric on G. Consider the function 

F{s) = detLegf . (3.14) 

The following result is a rephrasing of a theorem by Veselov and Veselova [1988], see also Fedorov and Jovanovic 
[2003] (Theorem 3.3)^^. 

Theorem 3.3. The reduced LR-Chaplygin system in the homogeneous space T*{G/H) always has the 
invariant measure 

u = F{s)-^/^dsi---dsmdpi---dpm , F{s) = det Legf . (3.15) 
The density can be also calculated by the "dual" formula 

Fis) = det(yl)det {P^oA~\-iLi,^H)g) (3.16) 
(Pg is the projection over g~^ Lie{H)g parallel to g~^T>g). 

The second formula may be easier to use if there are few constraints. 

Almost Hamiltonian systems. Let Q be a non-degenerate (but in general, non-closed) 2-form on M^", 
and H he a function on M. Denote (as usual) hy X = Xh the skew-gradient vectorfield defined by 
ix^ = dH. We say Xh is almost Hamiltonian. If a is a closed 1-form, the vectorfield X = X^ defined by 
ix^ = a is called locally almost Hamiltonian. Distilling a construction in Stanchenko [1985], we formalize 
an extension of the notion of a conformally symplectic structure. 

The 2-form is called H (or a)- affine symplectic if there is a function / > on M and a two form 
such that i) ix^o = ; ii) — is non-degenerate, and iii) Cl = f{^l — Oo) is closed^^. 

The first condition implies that X does not "see" ilo- Together with the third, we get Q{X/ f, •) = dH 
so the vectorfield X/f is (truly) Hamiltonian with respect to the symplectic form 0. 

The closedness condition can be restated as 



d{n - Qo) = {n-no) /\0 , where 9 = df / f . (3.17) 

When (3.17) holds with a a closed (but not necessarily exact) 1-form, we say that Q is locally affine 
symplectic. The following proposition describes the obstruction to Hamiltonization once / is given. 

^''We do not need to assume T> and Lie{H) to be orthogonal with respect to the bi-invariant metric. 

^*We must admit, however, that we found no example yet where the aiBne term is really needed. This notwithstanding, at 
any point where X 7^ 0, the contraction condition yields d — 2n equations on d{d — l)/2 unknowns (local coordinate coefficients 
of Q,o)- This allows additional freedom to Hamiltonize X rather than just requiring conformality of Q. 
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Theorem 3.4. Given a locally almost hamiltonian system (i^,a) and an educated guess / > 0, an affine 
term exists with d{fQ — Qq) = if and only if ix d{fQ) = 0. 

The proof is quite easy. The vectorfield X satisfies ix^ = ct. Since the same equation holds by replacing 
X by X/f and Q by ffl, to expedite notation we may assume / = 1. Let us prove that exists if ixdVt = 0. 
Since d{ix O) = da = 0, we see that the Lie derivative Lx^ = 0. Consider a regular point of X. By the 
flow box theorem there are coordinates so that X = d/dxi. Since Lx^ = 0, the coefficients of this 2- form 
do not depend on the coordinate xi (but there may exist terms with a dxi factor). However, our hypothesis 
iQ/Qxi d^l = ensures that there are no terms containing a dxi factor in dil. Thus dQ can be thought as a 
3-form in the space of the remaining coordinates. By Poincare's theorem dil. = dQo, where is a 2- form 
in the space of the remaining coordinates. Hence ix^o = and d{Q. — ilo) = 0, as desired. The converse is 
even easier. 

3.2 Examples: Veselova's system and Chaplygin spheres (marble or rubber) 

Veselov and Veselova [1986, 1988] considered one of the simplest nonholonomic LR-Chaplygin system, Q = 
50(3) with a left invariant metric L = T = ^(AO, O), and subjected to a right invariant constraint which, 
without loss of generality, can be assumed to be ps = 0. Hence the admissible motions satisfy = 0, where 
uj is the angular velocity viewed in the space frame. This is a LR Chaplygin system on ^ S0{3) — > S'^. 

Chaplygin's ball is a sphere of radius r and mass fi, whose center of mass is assumed to be at the 
geometric center, but the inertia matrix A = diag(Ii, 12,13) may have unequal entries. Thus its Lagrangian 
is given by 2L = [AQ, O) + fi{x'^ + + i^) • The configuration space is the euclidian group Q = SE{3). 

In the case of the marble, the ball rolls without slipping on a horizontal plane, with rotations about the z- 
axis allowed^^. Thus the distribution of admissible velocities is defined by 2? : z = , x = ruji , y = —ruj2 ■ 
Both Lagrangian and constraints are preserved under the action of the euclidian motions in the plane, 
together with the vertical translations. G = SE{2) x ]R acts on Q via 

{(]), u, V, w).{R, X, y, z) = {S{<j))R, e^'^{u + iv), z + w) . 

The dynamics could be be directly reduced to T>/G, see e.g. Zenkov and Bloch [2003], but we will proceed 
in two stages. First, we Chaplygin-compress the dynamics from TQ to TSO{'i) using the translation 
subgroup of «S'-E(3), regarding the constraint distribution as an abelian connection on Q with base space 
S = 80(3) and fiber JE?; the connection form is given by 

otmarble ■= {dx - rp2 , dy + rpi, dz) . (3.18) 

There is another action on Q, this time acting on the first factor only: e^'^{R,z) = {S{(j))R, z). This 
action preserves the Lagrangian but does not preserve the distribution: D{^s{<t>)R,z) / e**^ -^'(ij.z)- However, 
its infinitesimal action is given by the right vectorfield X^ G "D. Noether's theorem applies, so p^p = £3 
is a constant of motion. Therefore Chaplygin's marble equations can be reduced, on each level set ^3, to 
T{SO{3)/S^) =TS^. 

Chaplygin [2002] showed that the 3d problem is integrable using elliptic coordinates in the sphere; for n > 3 the problem 
is open. For basic informations, see Fedorov and Kozlov [1995], p. 147-149, on the 3-d case and p. 153-156 for the general 
n-dimensional case. For a detailed account on the algebraic integrability of "Chaplygin's Chaplygin sphere", see Duistermaat 
[2000]. Schneider [2002] analyzed control theoretical aspects. 
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In the case of Chaplygin' s rubber balf^, rotations about the vertical axis are forbidden (since such 
rotations would cause energy dissipation). Here the constraints are defined by a sub-distribution 7i C V 
with Cartan's 2-3-5 growth numbers and in fact defining a connection on SE{2) x M Q ^ with 
1-form 

arubber ■= (ps k , dx - rp2 , dy + rpi , dz) . (3.19) 



The extrinsic viewpoint For clarity we present the classical, direct derivation of the equations of motion, 
following the "extrinsic viewpoint" advocated by the Russian Geometric Mechanics school (Borisov and Mamaev 
[2002a]). 

• For the rubber Chaplygin ball (and Veselova's): in the space frame one has £ = t, where r = A/c is 
the torque exerted by the constraint force. The torque is vertical because (r, w) = for all to with third 
component equal to zero. Viewed in the body frame, 

L + nx L = Xj , (3.20) 

Together (1.6), one gets a closed system of ODEs in the space {L,j) G x M^, provided the relation 
between and L is obtained. In Veselova's example, = A~^L. The multiplier can be eliminated by 
differentiating the constraint equation (^^,7) = 0. After a simple computation, one gets 

A = . 3.21 

Besides the standard integrals of motion 2H = {A''^L,L), (7,7) = 1, {A~^L,^) = 0, Veselov and Veselova 
[1988] showed that there is a quartic polynomial integral 

G = (L,L)- (L,7)2 (3.22) 

and an invariant measure^^ 

p = f{7)dLi A dL2 A dLs A dji A d72 A ^73 , fil) = t)"^^^ • (3.23) 

• For Chaplygin's marble: the angular momentum at the contact point, in the space frame i is constant. 
An engineer would argue that both gravity and friction produce no torque at that point; a mathematician 
would use the fact that the admissible vectorfields Vi £ Ti. given by 

Vi := -r d/dy + X^'S^t ^ := r d/dx + X^'^ht ^ _ ^right ^3 

preserve the Lagrangian, and would invoke NH-Noether's theorem. Whichever explanation chosen, differ- 
entiating RL = i = RL and Rj = k one gets Chaplygin's equations 

L = -nx L , 7 = -Jlx7. (3.25) 



^"This problem was not studied by Chaplygin. For the physical justification, see Neimark and Fufaev [1972] and 
Cendra, Ibort, de Leon, de Diego [2004]. As far as we know its integrability has not yet been established. Formally, Veselova's 
system is the limit of Chaplygin's rubber ball as r — » 0. 

■^'^The level sets of the four integrals are 2-tori, since there are no fixed points in the dynamics. The existence of an invariant 
measure in the tori allows the explicit integration via Jacobi's theorem. Veselov and Veselova [1988] found a rather unexpected 
connection with Neumann's problem". 
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These two form a coupled system, since again Q. is a linear function of L depending only on 7: 

L = (n) = An + fir^-f X X 7) = - /zr2(7, 0)7 , A:=A + /xr^id (3.26) 
A simple way to get this map is to look at the total energy 

2T = (w, i) = {Q, L) = {An, n) + fiix"^ + 2/2) = {An, Q) + fir^{iol + uj^) (3.27) 
which can be also written as 

2T = (o, L) = {An, n) + nr'^{n , X {nx -f)) = {n, An + fir^-f x {nxj))) . (3.28) 

The expression 7 x (• x 7) represents the projection in the plane perpendicular to 7, and we get (3.26). An 
ansatz for the inverse of the map (3.26) is (Duistermaat [2000]), 

n = n{L,-f) = (L^)-i(L) = A~^L + a{L)A-^{j) (3.29) 

and one gets the interesting expression for a{L) (which will be used in equation (3.48) and Proposition 3.8): 

a{L) = fir ^— — . (3.30) 

1 — /ir^(7, A~^ 7j 

The function 

/(7):=[l-/ir2(7,i-i7)]-V2 (3.3I) 

was found by Chaplygin to be the density of an invariant measure in M^: 

^iR6 = /(7) d'yid'y2d'y3dLidL2dL3 (3.32) 

This follows from Veselov's theorem, as -^(7) = l — fj.r'^{'y, A~^j) is (up to a constant factor) the determinant 
of the linear map 1— > L = L(f];7). For direct proofs of invariance of the measure, see Duistermaat [2000] 
or Fedorov and Kozlov [1995]. 

A system of ODE's for the rubber ball can be derived in a similar fashion. For the angular momentum 
i at the contact point, we get the same equation (3.20) from Veselova's system, but the relation between 
n and L is (3.26), the same as in Chaplygin's marbe. Differentiating (^^,7) = the multiplier can be 
eliminated. 

Hamiltonization of Veselova's system. The compressed Lagrangian is 

icomp = ^(^(7X7),7X7) , (3.33) 

since = 7 x 7; the momentum map corresponding to the 5^-action is J = ^3 = (^,7). Thus (J.K) = 
^3 dps = {An,^) dp3, where dp^ is the area form of 5^. The compressed Legendre transform is 

7 1-^ a = — — = 7 X Ay^ X 7) . 
07 

The nonholonomic 2-form in T* S"^ is 

nNH = da A 0(7 + (^(7 X 7) , 7) dp3 (3.34) 
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Being a two-degrees of freedom system, a general result from Fedorov and Jovanovic [2003] (Theorem 3.5) 
garantees that this system is Hamiltonizable. In order to verify that ^nh is conformally symplectic, it is 
simpler to use 7 themselves as coordinates, that is, we pull back 0,]^^ to TS^ via Leg*. We get 

^NH = d{'y X A{j X 7))) A (i7 + (7, ^(7 x j))dp3 . 

Proposition 3.5. Veselova's system is conformally symplectic, d{filNH) = 0, with conformal factor 

/ = /(7) = (A-S,7)"'/'- (3.35) 

As expected, it is the density of the Veselovs invariant measure fi = f{'y)dLd'y obtained via Proposition 
3.3. The orthonormal frame in diagonalizing (3.33) provides explicit coordinates for integration via 
Hamilton-Jacobi's method. 

Chaplygin's rubber ball. The dynamics compress to T*S^, and by the same general result in Fedorov and Jovanov 
[2003], we know in advance that the system is Hamiltonizable. Choose a moving frame 61,62 in 5^. The 
horizontal lift from j = vi ei + V2 62 to Hor(7) G T{SE{3)) is easily done via (1.10): 

Hor(7) = V2 {X{ — rd/dy) — vi {X2 + rd/dx) 

Composing dambber = {pi A p2, — r/>3 A pi,rp2 A ps, 0) with Hor, we get Kmbber = (dSk, 0, 0, 0), where dS is 
the S'^ area form. Thus for the term (J.K) we need only the third component of the angular momentum, 
m3 = (M, 7) = (AQ, 7), where we insert (1.10) $7 = 7 x 7 = ^2 ei — 62- Therefore 

n^H = nT*s^ + (^(7 X 7), 7) • dS . (3.36) 

Here 7 = ei + f 2 62 G TS"^ corresponds to p^^ = pi9i + p2 62 via the Legendre map Leg'^°"^^ of the 
compressed Lagrangian 

Lcomp = 'i^A{v2 ei - vi 62, V2 ei - vi 62) + ^pr'^ivl + v^) . (3.37) 

Clearly, this system becomes Veselova's for r = 0. Using Proposition 3.3 and Fedorov's result for two 
degrees of system, we get: 

Proposition 3.6. The compressed rubber ball system is Hamiltonizable. The conformal factor is 

/ = [detW°-^]-V2 = (j,j,j3)-i/2 [iA-\l)+pA^^ + ^-^ + ^-^ . 

(3.38) 

Proof.^^ We checked using spherical coordinates and Mathematical. 

^^We can provide the (short) notebook under request. It should be investigated if the rubber baU problem is integrable. 
Does a (quartic) integral still exist? 
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The homogeneous sphere, hi a nutshell: the dynamics in the homogeneous case is embarrassingly 
simple. The angular velocity in space is constant, so the attitude matrix R evolves as a 1-parameter group 
R = exp([u]t), so and u are constant. The vector 7(t) describes a circle in the sphere perpendicular 
to u!, and L{t) the curve given by L{t) = (/ + fir'^)uj — uj3j{t). Provided £ is not vertical, L and 7 are 
never parallel. The invariant tori are always foliated by closed curves and the two frequencies coincide. 
Prom the constraint equations we see that the motion of the contact point in the plane is a straight line. 
Shooting pool with a perfect Chaplygin ball is very dull^'^. Let us use these simple results as template for 
our operational system. In terms of the right coframe, we have 



n 



■NH = Mipi + M2P2 + di-sps + hp2P3 + hpsPi + ^3PiP2 - P-r"^ {i^2P3Pi + W1P2/O3) (3.39) 

This formula holds in general. In the nonhomogeneous case one must write uji and L02 in terms of i and 
R G 50(3): uj = RQ. = RQ^{R~^i) which seems to be a quite involved expression, a haunting monster 
we will avoid, until a final confrontation in Proposition 3.8. In the homogeneous case, life is much easier: 



Lu = R^IR- 



m 



rm , so the dependence of a; on disappears. The Hamiltonian is given by 



H 



/ + fir" 



+ 
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To obtain the equations of motion we solve 
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(3.40) 



UJ2 ■ This gives, as 



Thus Ui = mi/I = const , i = 1,2,3 and the vectorfield is simply X = ajiX^^^^ + 0:2X2^^^ + uj^X^^^^^ 
(no components in the fiber directions d/dmi). We now use Theorem 3.4. Using m as coordinates, the 
nonholonomic two form is given by 



fir 

^NH = (1 + — ) (dmipi + dm2P2) + dm-sps + {mip2P3 + m2P3Pi + m^P\P2) 



^^We found the following relevant information in www. ot.com/skew/five/myths. html (Top Ten Myths in Pool or the Laws 
of Physics Do Apply): "4. If the cue is kept level, contacting the cueball purely left or right of its center will make it curve 
as it rolls. (No! The rolling cue ball can have two completely independent components to its angular momentum. Basically, 
this means that it can rotate in the manner of a top while rolling slowly forward along a straight line. In general, spin on a 
cue ball is of two types; follow/draw is the spin like tires on a car, while English is the spin like a child's toy 'top'. Seperately, 
neither one will make a ball curve! If they are combined - e.g., strike low-left giving left English and draw - then the spin is 
called masse (mass-ay), and the ball will curve as it travels.)" 
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^2 



SO that d^l]\fH = —^{dmip2P3 + dm2Pzpi) ■ It is easy to see that the equation dQ.NH = ^nh A a has no 
solution whatsoever. Indeed, suppose a = Aidrrii + A2dm2 + A^dm^ + Bipi + B2P2 + -B3/33 . Taking the 
exterior product, and looking at terms like dmidm2P2 we see that all the A's must be zero. Examining the 
coefficient of P1P2P3 we get Bimi + B2m2 + B^m-s = so all the B^s are also zero. 

Hence the homogeneous Chaplygin sphere, as simple at it can be, has no conformal symplectic structure! 
In fact, it does not have an affine symplectic structure either. A short calculation shows that 

2 

pT 

ixdVlMH = -pr(-dmim2P3 + dmip2m3 - dm2m3pi + dm2P3mi) / 0. 

By continuity, for sufficiently close but different inertia coefficients the inequalities persists. We have 
also done the calculation for the non homogeneous case and things only get worse. But, it still remains a 
possibility: is the reduced system to r*^^ Hamiltonizable? The impatient reader can go directly to Theorem 
3.8. 

3.4 Chaplygin's marble: reduction to T*S'^ 

Using (1.8), L = a X 7 + ^3 7, Chaplygin's marble equations in (L,7)-space directly reduce to T*S'^: 

7 = 7 X 17 , d = -2H 7 + (7, f]) • (a X 7 + ^3 7) (3.41) 

with 

n = n{a,r,i3)=A-'L + pr' ^\^^'^,\ , . 

1 — pr'^lj, A ^) 

reduction of the homogeneous sphere to T*S'^. The homogeneous Chaplygin sphere when reduced 
to r*^^ , produces a more interesting system. Equations (3.41) become 

1 1 , ,2 / N 

7 = — ' a = u;3aX7-— i^\a\ (3.42) 

1 + pr^ I + pr'^ 

One observes that (a, 7) = and that |ap is conserved. So at each level set, we get an isotropic 3d-oscillator 
with a Lorentz force^^. 



Dimension count argument. In hindsight, we can give two simple arguments why the Chaplygin marble 
could not be Hamiltonizable at the T*S0{3) level. First: if {T*SO{3),0,iyf{jH) were Hamiltonizable, the 
system would be Liouville integrable by "mere" symmetries, due to the existence of three independent 
first integrals H,i3,£f + But it is known that integrability of Chaplygin's marble stems not from 

symmetries, but from a special choice of separating coordinates (Duistermaat [2000]), namely, elliptic 
coordinates of the sphere. Second: Stanchenko [1985] verified that Chaplygin's density function F (3.31) of 
the system in also gives an invariant measure on T*SO{3) (see also Duistermaat [2000], section 7), 

'^T*soi3) = F{^)d\id\2d\3dLidL2dL3 , F = [I - pr\j, i-S)]"'/'. (3.43) 

Were the compressed system Hamiltonizable in T*SO{3), the conformal factor (time reparametrization) 
1 

should be ^(7)™-!, with m = 3, see (3.13). But the correct time reparametrization holds with m = 2 
instead of m = 3. This strongly suggests that Hamiltonization should be attempted after reduction of the 
internal symmetry. 

^''Alan Weinstein commented in more than one occasion that "unreduction" sometimes is even nicer than reduction: unre- 
ducing a non-trivial system may lead to an trivial one. Alan credits this moto to Guillemin and Sternberg; one reference could 
be Guillemin and Sternberg [1980]. 
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Phase locking. The fact that Chaplygin's sphere is integrable implies an interesting phase locking prop- 
erty. For simplicity, consider a resonant torus and a periodic solution, 7(T) = 7(0) , L{T) = L{0). We may 
assume that R{0) = identity so R{T) preserves both k and £. If we assume i 7^ it/c, then R(T) must also be 
the identity (there is only one orthogonal matrix with two different eigenvectors with equal eigenvalues 1). 
Since the rotational conditions are reproduced after time T, there is a "planar geometric phase" (meaning 
a translation), Az = (Ax, Ay). From Duistermaat [2000], section 11, one knows this direction: 

Proposition 3.7. In average, Az moves in the direction of i x k. 

In the normal direction kx(ixk) there is a "swaying motion" , with zero average, see Duistermaat [2000] , 
(11.71), and remark 11.11. This result depends on the explicit solution in terms of elliptic coordinates, but 
the zero average can be proved in a more elementary way, see Duistermaat, section 8.2. In the direction 
i X k one has 

^ iz{t),i X k) = r {u X k , I X k) = r {uj , i - isk) = r {2T - i^u^ ) > . 

Duistermaat [2000] shows (section 9.2) that by a suitable change of coordinates, one may assume that 
^3 = 0, so in this equivalent problem, the velocity in this direction is simply 2rT. 

Chaplygin's marble via the almost Hamiltonian structure. After this detour, we hope the reader 
will appreciate a concise way to describe this system. The clockwise map is 

T(SO(3)xiR2) — , r*(50(3) X iR2) (0,i,y) ^ {M = An, = fi±, Py = py) 
Leg 

T T 

I I (3.44) 

TSO{3) < — T*SO{3) n ^ L 

{Leg^r^ 

where {x,y) = ru> x k , and {Px,Py) = proo x k . We now compute the "gyroscopic" 2-form: 

(J.K) = r{-Pxdp2 + Pydpi) = pr{-xdp2 + ydpi) = -pr'^{uJ2dp2 + OJidpi) (3.45) 

To obtain oji and 002 as functions in T*S'0(3), we use the Legendre transformation: uj = RVL = RA^^M so 
(J.K) is a combination of the basic forms /O3 A pi , p2 A (coefficients linear in M and functions of R). 

invariance. We claim that VInh is 5'^-invariant {S^ acting only in the first factor of Q = 50(3) x IR?). 
For the canonical term this is a standard symplectic fact. The (J-K) term is invariant as well: (3.45), written 
in terms of the left-invariant forms, depends only on the Poisson vector 7: 

(J.K) = pr'^ (7 X (0(L,7) X 7) , dX) (3.46) 

In fact, the action generated by the right invariant vectorfield X^^^^^ maintains the projection 7 fixed. We 
know (general nonsense) that the right invariant vectorfields preserve the left invariant forms: i?^ Aj = Aj . 

(Proof: {R'^Xi){R) = Xi{R^R[Q]) = Jlj). Since under the left action (actually under the left action of 
50(3) on 50(3)) the value of O remains unchanged, the (J.K) term is preserved. 
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The twisted action generator and S reduction. MW reduction method works fine, although is 
the Hamiltonian vectorfield of J = £3, relative to the canonical symplectic form, but not relative to ^nh- 
We just change to the twisted S'^-action generator X3, defined by ^nh = —dis- A simple computation 
gives X3 = X3 — 1712 + rrii where mi = ii — fj.r'^uji , m2 = ^2 — The reduced manifold is 

the quotient of a level £3 in T*SO{3), identifying the flow lines (p. A concrete realization is achieved using 
(1.11). Taking the pullback via i* , the reduced form is then 

Qred = ^T^s^ + ^3 area52 - fir'^{uJi d92 - 002 dOi) (3.47) 

where we recall the parametrizations piOi + P2O2 £ T* S"^ , i?(7) = rows (ei, 62, 7). 

In (3.47) we must write wi, 1x12 explicitly in terms of pi, p2, is- To write this explicitly, there is no other 
option than to confront the monster (which actually is not that terrible): from Q = R~^u> = uJi ei + W2 62 + 
ijjz 7 and from (3.26) we get 

uj = i?(7) [i?(7)]^ ^ , ^ = (P2, -Pi, ^3) . (3.48) 
where 0^ is explicitly given by (3.30). 

Theorem 3.8. ixdif^red) / , 7(7) = [1 - ^ir^{^,A-^^)]-^l^ . 

Proof. We used spherical coordinates {fdute de mieux) and a Mathematica© notebook. It misses being 
conformally symplectic by very little (even in the homogeneous case)^^. 

Our calculation suggests that Chaplygin's sphere is not affine symplectic even at the T*S^ level, so 
Chaplygin's sphere integrability is due to a specific nonholonomic phenomenon. This observation is in 
accordance with the opening statement in Duistermaat [2000]: 

"Although the system is integrable in every sense of the word, it neither arises as a Hamiltonian 
system, nor is the integrability an immediate consequence of the symmetries" . 

4 Recent developments and final comments 

NH systems have a reputation of having peculiar (even rebellious) dynamic behaviour (Arnold, Kozlov, and Neishtadt 
[1988]). In spite of good progress, the general theory for NH systems is way behind the theory of Hamil- 
tonian systems. For instance, although the groundwork for an Hamilton-Jacobi theory for NH systems has 
been set up in Weber [1986], not much has been achieved since then. 

We have no intent (nor competence) to make a survey of recent developments in NH systems, specially 
regarding reduction of symmetries; nevertheless it may be worth registering the intense activity going 
on. Recent books of interest are Cushman and Bates [1997], Cortes [2002], Oliva [2002], Bloch [2003] and a 
treatise in the Mechanical Engineering tradition is Papastavridis [2002]^^. Reports on Mathematical Physics 
has been publishing NH papers regularly, and Regular and Chaotic Dynamics devoted large parts of vols. 
1/2 (2002) to NH systems. For older eastern European literature see PMM USSR, J. Applied Math, and 
Mechanics., which has strongly influenced Chinese mechanicists as well. For a historical account on NH 
systems, from a somewhat "anti-reducionist" perspective, see Borisov and Mamaev [2002a]. 

^^Borisov and Mamaev [2002] showed by a subtle numerical evidence that, in the original time, Chaplygin's marble is 
not Hamiltonizable at any level of reduction. The question whether Chaplygin's marble is Hamiltonizable in the new time 
dt/dr = 7(7) was addressed in Borisov and Mamaev [2001]. They provide a bracket structure in terms of the coordinates (L, 7) 
or the coordinates (I/, 7), with L — L/f{'y). Using a computer algebra program we checked that the second brackets satisfy 
the Jacobi identity. However, we could not recover Chaplygin's equations for the L coordinates, even in the homogenous case. 

^® Reviewed in Koillcr [2003]. 
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4.1 Invariant measures and integrability 

Kupka and Oliva [2001] and Kobayashi and Oliva [2003] find conditions insuring a special, but very inter- 
esting situation, where the Riemann measure in TQ induced by the metric in Q, is an invariant measure 
for the NH system. Invariant measures for systems with distributional symmetries were characterized in 
Zenkov and Bloch [2003]. 

Curiously, although a number of interesting NH systems have been solved using, say, Abelian functions, 
a precise definition for integrability of a NH system is still lacking (Bates and Cushman [1999]). These 
examples suggest that the presence of an invariant measure must be imposed as a necessary (although not 
sufficient) condition for integrability (whatever it may be), see Kozlov [2002]. Most of them have enough in- 
tegrals of motion that the dynamics occurs on invariant 2-dimensional tori. Due to the invariant measure, the 
flow becomes linear in these tori after a time rescaling. This follows from Jacobi's multiplier method and Kol- 
mogorov's theorem (Arnold [1989]). Time reparametrization indicates the possibility of an affine symplectic 
structure. We believe that characterizing NH systems possessing an affine symplectic structure (if needed, 
after some reduction stage) could be an interesting project. As a first step, one may examine the exisying 
literature to see which examples fit. We list a few papers for that purpose: Veselov and Veselova [1988], 
Veselov and Veselova [1986], Fedorov [1989], Cushman, Hermans and Kemppainen [1995], Zenkov [1995], 
Zenkov and Bloch [2000], Dragovic, Gajic and Jovanovic [1998], Jovanovic [2003], Fedorov and Jovanovic 
[2003]. One can hope that the manifestly geometric character of (1.14) can be instrumental to under- 
stand when, where and why Hamiltonization is possible. Moreover, a prior geometric understanding of 
the invariant volume form conditions is a more general question. It would be also interesting to tie the 
"Hamiltonizable" question with the invariants from the Cartan equivalence viewpoint, see below. 

4.2 Nonholonomic reduction. 

The difficulties in reduction for general NH systems are explained in Sniatycki [2002] . There are four current 
theories of reduction of symmety for nonholonomic systems^'': i) projection methods, see Marie [1995], 
Dazord [1994]; ii) the distributional Hamiltonian approach, initiated by Bocharov and Vinogradov [1977] 
and developed in Bates and Sniatycki [1993], Cushman, Kemppainen, Sniatycki and Bates [1995], Sniatycki 
[1998], and Cushman and Sniatycki [2002]. iii) bracket methods, initiated by Mashke and van der Schaft 
[1994], and developed by Koon and Marsden [1998] and Sniatycki [2001]; iv) Lagrangian reduction, see 
Cendra, Marsden and Ratiu [2001]. 

A few other references in this rapidly developing theme, besides those already mentioned are: Bates 
[2002], Koon and Marsden [1997], Cantrijn, de Leon, Marrero and de Diego [1998], Cortes and de Leon [1999], 
Marie [1998], Marie [2003]. 

Almost Poisson, almost Dirac approaches. Mashke and van der Schaft [1994] were the first to de- 
scribe a NH system using an almost-Poisson structure^^, Xi = {xi, H}ms- This bracket, defined on the mani- 
fold P = LegiTi) C T*Q, where Leg : TQ — > T*Q is the Legendre transformation, in general does not satisfy 
the Jacobi identity. They proved that the Jacobi identity holds if and only if the constraints are integrable. 
In Koiller, Rios and Ehlers [2002] we gave a moving frames based derivation of the bracket structure. For 
some recent work on the MS-bracket and also Dirac estructures (the latter introduced in Courant [1990]), 

^^We thank one of the referees for this information. 

■^^Physicists are never shy to use the word "super" in their endeavours; on the other hand we, mathematicians, prefer to use 
low key terminology, like "almost-quasi-twisted-(freakaz-)-'oid's" ; this certainly does not help our image problem with applied 
people, see Papastavridis [2002] and Koiller [2003]. 
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see Cantrijn, de Leon, de Diego [1999], Koon and Marsden [1998], Ibort, de Leon, Marrero and de Diego 
[1999], Clemente-Gallardo, Maschke and van der Schaft [2001]. In spite of these advances, a complete un- 
derstanding of the NH bracket geometry is stih in order^^ . 

4.3 G-Chaplygin systems via affine connections 

Trajectories of the compressed system can be described as geodesies of an affine connection in S 

(Vershik and Fadeev [1981], Koiller [1992]). For background in this approach, see Lewis [1998] and references 
therein. Consider the parallel transport operator along closed curves; if the holonomy group is always 
conjugate to a subgroup of 50(771), then the connection is metrizable. This means that there is a metric 
such that is precisely the Levi-Civita connection of this metric. More generally, one may want to 

know when the geodesies of are, up to time reparametrization, the geodesies of a Riemannian metric. 
This is a traditional area in differential geometry, whose roots go back to the 19th century, and goes under 
the name of projectively equivalent connections (Cartan [1937], Eisenhart [1925], Kobayashi and Nomizu 
[1963], Sharpe [1997]). Grossman [2000] studies integrability of geodesies equations via the equivalence 
method. Our problem, then, is to find conditions for the NH connection to be projectively equivalent to a 
Riemannian connection. It would be also interesting to tie the Hamiltonization question with the canonical 
system and invariants of the Cartan equivalence method. When an internal symmetry group is present, it 
would be desirable to construct a projected connection in S for each set of conserved momenta, and address 
these issues in the reduced level. 

Acknowledgements. Thanks to Hans Duistermaat for informations on Chaplygin's sphere and the referees 
for very good criticims and suggestions. This is also a special occasion to thank our mathematical family: 
Alan, of course; mathematical brothers and sisters from all continents (specially Yilmaz Akyildyz and 
Henrique Bursztyn) and cousins (specially Tudor Ratiu and Debra Lewis), "uncle" Jerry Marsden and, last 
but not least, "grandfather" Chern (with his gentle voice, commanding us to keep interested in Math, up 
to his age). 

We shall toast in many more Alanfests: Madadayo! 
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